10110-10123 Nucleic Acids Research, 2013, Vol. 41, No. 22 
doi:10.1093/nar/gkt778 



Published online 2 September 2013 



Antisense transcripts enhanced by camptothecin 
at divergent CpG-island promoters associated 
with bursts of topoisomerase l-DNA cleavage 
complex and R-loop formation 

Jessica Marinello 1 , Giovanni Chillemi 2 , Susana Bueno 2 , Stefano G. Manzo 1 and 
Giovanni Capranico 1 '* 

department of Pharmacy and Biotechnology, University of Bologna, via Irnerio 48, 40126 Bologna, Italy and 
2 CASPUR, via dei Tizii 6, Rome 00185, Italy 



Received April 19, 2013; Revised and Accepted August 6, 2013 



ABSTRACT 

DNA Topoisomerase I (Topi) is required to relax 
DNA supercoils generated by RNA polymerases 
(RNAPs). Topi is inhibited with high specificity by 
camptothecin (CPT), an effective anticancer agent, 
and by oxidative base damage and ribonucleotides 
in DNA strands, resulting into Top1-DNA cleavage 
complexes (Toplccs). To understand how Toplccs 
affect genome stability, we have investigated the 
global transcriptional response to CPT-induced 
Toplccs. Toplccs trigger an accumulation of anti- 
sense RNAPII transcripts specifically at active 
divergent CpG-island promoters in a replication- 
independent and Topi -dependent manner. As CPT 
increases antisense transcript levels in the pres- 
ence of 5,6-dichloro-1-beta-D-ribofuranosylben- 
zimidazole, a transcription inhibitor, Toplccs likely 
impair antisense RNA degradation. Time-course 
data showed a burst of Toplccs increased by CPT 
at promoter sites and along transcribed regions, 
causing a transient block of RNAPII at the 
promoter. Moreover, cell immunofluorescence 
analyses showed that Toplccs induce a transient 
increase of R-loops specifically at highly transcribed 
regions such as nucleoli in a Topi -dependent 
manner. Thus, a specific and highly dynamic tran- 
scriptional response to Toplccs occurs at divergent 
active CpG-island promoters, which may include a 
transient stabilization of R-loops. The results clarify 
molecular features of a response pathway leading to 



transcription-dependent genome instability and 
altered transcription regulation. 

INTRODUCTION 

Transcription by RNA polymerase II (RNAPII) is 
controlled by a regulated interplay of DNA transactions 
and factors controlling cell identity. The complexity of 
transcription regulation is increased by the recent know- 
ledge that divergent transcription is a common feature of 
gene promoters in several organisms, generating short and 
long (>200bp) non-coding RNAs (1-6). Moreover, DNA 
conformation and superhelicity can affect transcription 
and transcription regulation (1-6). On one hand, negative 
and positive supercoils of DNA template can impair tran- 
scription in living cells (4,6), whereas on the other, DNA 
supercoiling has a basic function in transcription regula- 
tion in prokaryotes (2) and during development and patho- 
logical states of eukaryotic cells (1,3,7). Interestingly, 
negatively supercoiled DNA can form non-B structures, 
such as R-loops, that can impact transcription-associated 
recombination and mutation, Ig gene class-switch recom- 
bination and transcription regulation (8-11). 

DNA topoisomerase I (Topi) modulates DNA super- 
coiling during fundamental DNA metabolic pathways (12) 
and is the highly specific target of camptothecins (CPT), 
highly effective agents against human cancers (13,14). 
Moreover, drug therapeutic potential has recently been 
broadened, as CPT derivatives can unsilence the silent 
allele of Ube3a in a mouse model of human Angelman 
syndrome (15), suggesting a significant impact of Topi 
inhibitors on neurological disorders. Topi can remove 
DNA supercoils by cleaving a strand of a DNA duplex 
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and forming a transient DNA-Topl cleavage complex 
(Toplcc) in which the cut strand is covalently linked to 
the protein. Then, Topi allows the rotation of the cut 
strand around the intact one and eventually reseals the 
strand break (16,17). CPT interacts at the cleavage site 
with DNA and the enzyme impeding the break resealing 
reaction and stabilizing Top Ices (18). Top lecs are highly 
reversible both in vitro and in living cells, and CPT 
analogs act to increase significantly their half-life (2min 
on average) resulting in a decrease of the DNA uncoiling 
rate by the enzyme (19). In living cells, CPT action results 
in an immediate increase of Topi -mediated DNA breaks, 
and inhibition of Topi activity, transcription and replica- 
tion (17,19-21). The latter drug effect is likely due to col- 
lisions of Top lecs with advancing replication forks that 
can result into frank DNA double-stranded breaks. The 
replicative DNA breakage eventually triggers cell cycle 
arrest and apoptosis of cancer cells (17,21). However, 
the molecular bases of Toplcc interference with transcrip- 
tion regulation and induction of genome instability are not 
fully clarified (13). 

According to the 'twin-supercoiling domain model', 
RNAP elongation can lead to local accumulation of 
positive supercoils ahead of the RNAP and negative super- 
coils behind (5,21). The bulk of cellular Topi activity is 
present at transcribing regions, where its activity can be 
redundant with Top2, as both enzymes can relax either 
positive and negative supercoils (5). However, different so- 
lutions for supercoil removal and specialized roles of each 
enzyme can be operative in cells as indicated by findings in 
yeast where Top2 is mainly responsible for positive super- 
coil removal in front of RNAPs and Topi removes 
negative supercoils behind the polymerase (22,23). Early 
works provided evidence that Topi also functions at the 
transcription initiation step (24-26). Topi is likely an 
enzyme that can more rapidly respond to local torsional 
tension in nucleosome-free regions (4,27), a molecular en- 
vironment common to promoter regions where Topi has 
been mapped (22,28). In addition, Topi has a role in chro- 
matin organization and remodeling, as the disassembly/ 
assembly of nucleosomes is thought to be another main 
source of DNA supercoiling (29). Topi has been shown 
to affect chromatin organization at yeast telomeric 
regions (30) and active promoters (22,28,31). 

In recent years, we showed that CPT-stabilized Top lecs 
lead to hyperphosphorylation of RNAPII and decreased 
recruitment at human HIF-la, cMyc and GAPDH gene 
promoters (28,32). Unexpectedly, we found that Toplccs 
favor RNAPII escape from promoter-proximal pausing 
sites and increase the levels of antisense transcripts at 
the 5'- and 3'-ends of the human HIFla gene (31). The 
two antisense RNAs overlap with the primary sense tran- 
script of the HIF-la gene. Even though their functions are 
not fully known, we showed that the 5' antisense HIF-la 
RNA, corresponding to the first exon region, localizes 
mainly at the nuclear membrane close to the nuclear 
pore protein Nup62 suggesting a role in the transport 
and post-transcriptional regulation of mRNAs (33). 
Antisense transcripts accumulate for a 4-h period at the 
human HIF-la locus, suggesting that Toplccs can cause a 
specific transcriptional stress at active promoters. 



Nevertheless, the nature and generality of increased anti- 
sense transcripts at gene 5'-ends were left unresolved. 

Here, we investigated the molecular response to Toplccs 
at human gene promoters by next-generation sequencing of 
cellular transcripts. We show that CPT-stabilized Toplccs 
cause the accumulation of antisense RNAPII transcripts 
specifically at divergent CpG-island (CGI) promoters, 
mainly upstream of the transcriptional start site (TSS), in 
a replication-independent and Topi -dependent manner. 
In addition, Toplcc levels increase transiently at active 
promoters during CPT treatment, and parallel a transient 
RNAPII block and increase of R-loops in living cells. The 
findings highlight molecular events at specific active pro- 
moters that can trigger transcription-associated recombin- 
ation and transcription-associated mutation and set the 
basis to understand the functions of antisense transcripts 
in gene regulation and genome instability. 

MATERIALS AND METHODS 

Cell lines 

The cancer cell lines HCT1 16, PC3, U20S and COLO205 
were purchased from ATCC (LGC Standards S.r.l., 
Milan, Italy) and were grown in Dulbecco's modified 
Eagle's medium (HCT116) or RPMI (PC3, U20S and 
COLO205) mediums with 10% fetal bovine serum 
(M-Medical S.r.l., Milan, Italy). HCT116-shRNATopl 
cell line was gently provided by Y. Pommier (NCI, 
Bethesda, MD, USA) and was grown as HCT116 cells 
but in presence of 200ug/ml Hygromycin B. The cell 
line stably expresses a short hairpin RNA targeting exon 
17 in the TOPI gene (34,35). Cells were maintained at 
37°C in a humidified incubator containing 20% 0 2 and 
5% C0 2 . Cell line identity was certified with Cell ID 
System (Promega) by BMR Genomics Sri (Padova, Italy). 

Cell treatment and RNA extraction 

Exponentially growing cells were exposed to 10 uM of CPT, 
Topotecan (TPT), Etoposide or carboxylate forms of CPT, 
for 4h at 37°C, unless specified otherwise. In case of co- 
treatment, cells were incubated with aphidicolin (5 uM) or 
5,6-dichloro- 1 -beta-D-ribofuranosylbenzimidazole (DRB) 
(50 uM) for 15min, then CPT (10 uM) was added to the 
medium for further 4 h. For CDK9 silencing experiments, 
cells were transfected 24 h after plating (30% confluence) 
with RNAiMax Transfection reagent (Life Technologies) 
and 20 nM of scramble siRNA or CDK9 siRNA (Life 
Technologies) and exposed to drugs 72 h post-transfection. 
For transient Topi silencing experiments, cells were trans- 
fected in the same way using 60 nM of scramble siRNA or 
Topi siRNA (targeting exon 16 of the TOPI gene; Life 
Technologies) and exposed to drugs 72 h post-transfection. 
Total cellular RNA was purified with the acid phenol 
method (33), quantified by ultraviolet absorbance and 
quality controlled by electrophoresis. 

RNA bisulfite conversion and sequencing 

RNA bisulfite treatment was performed according with 
He et al. (36), with minor modifications (Supplementary 
Information). RNA was converted to cDNA using 
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random octamers and Superscript III Reverse 
Transcriptase (Life Technologies). Libraries were 
prepared from cDNA with mRNA-Seq Sample Prep Kit 
(Illumina) following Illumina's standard protocols. RNA 
sequencing was performed on Illumina HiSeq2000 
platform (pair-end 2 XI 00 bp, fiowcell v3, two samples 
for each lane) at IGA Technology Services S.r.l. (Udine, 
Italy). The RNA-Seq data have been submitted to the 
SRA database (http://www.ncbi.nlm.nih.gov/sra) with 
identifier PRJNA186590. 

Bioinformatics analysis 

Two independent RNA-seq experiments for each cell line 
were performed with similar results, and here, we report 
the results of analyses of the pooled data. Low-quality 
sequenced bases were filtered out using DynamicTrim 
v. 1.12 (37), which trims the reads to leave only the 
longest contiguous segment that surpasses the quality 
threshold (P = 0.05). The bisulfite-treated paired reads 
were then mapped twice to the GRCh37 human genome 
assembly in which all the cytosines were mutated in 
thymines (in the following named as CT-GRCh37) or all 
the guanine in adenine (GA-GRCh37) to identify the 
strand originating the sequence fragment. Sequences 
coming from positive strand transcripts align to the CT- 
GRCh37, whereas negative strand trascripts align to the 
GA-GRCh37. The read alignment was carried out with 
Bowtie vO.12.7 (38) and TopHat vl.4.1 (39) to identify 
known transcripts. For each of the four experiments, 
reads that aligned to both CT-GRCh37 and GA- 
GRCh37 were discarded. Cufflinks was used to assemble 
and identify novel transcripts, using reference genomes in 
which all the RefSeq genes in the + or — strands were alter- 
natively masked. Gene expression levels were estimated in 
FPKM units (expected number of Fragments Per Kilobase 
of transcript sequence per Millions of sequenced nucleo- 
tides) using Cufflinks (40), and differential transcript ex- 
pression between control and CPT-treated samples was 
then computed using Cuffdiff. 

Plots of the RNA fragments distributions with respect to 
the TSS (both sense and antisense) were calculated on a 
subset of 9665 not overlapping RefSeq genes, composed 
by 6626 and 3039 genes with and without CpG islands, 
respectively, in the region — 4000/+1000 bp as respect to 
the TSS. Cumulative plots were normalized for total 
aligned tags and number of genes in the respective class. 
The number of reads in the — 4000/+1000 bp region for 
each not overlapping RefSeq gene is reported as box-plot 
of the difference between control and CPT-treated cells 
(in the following A antisense reads and A sense reads). 

A single sample Wilcoxon signed rank test was used 
to estimate the central tendency of the A sense and A 
antisense reads groups (pseudo-median) with a 95% con- 
fidence interval (41). Moreover, two sample comparisons 
were performed using Wilcoxon rank sum tests to evaluate 
if couples of A antisense or A sense reads distributions 
were statistically different, one versus the other. 

K + SDS precipitation assay 

The K + SDS Precipitation protocol was described already 
(30). Briefly, cells were lysed with 1% SDS and DNA 



sonicated (Bioruptor®, Diagenode) to an average 
fragment size of 200-300 bp. Proteins were then 
precipitated with 2.5 M KC1 at 4°C and centrifugation at 
1200g for 3min. Pellets were washed five times with 
0.25 M KC1, and then treated with Proteinase K. DNA 
was purified by phenol-extraction and ethanol precipita- 
tion and quantified by real-time PCR. 

ChIP assay 

ChIP protocol was as previously described (28) with some 
modifications (see Supplementary Information). 

ICE assay 

ICE assay was performed as previously reported (42) with 
minor modifications (see Supplementary Information). 

Immunofluorescence 

HCT116 and HCT116-siRNATopl cells were seeded at 
low density and treated with 10 uM CPT 24 h after 
seeding. Cells were fixed and permeabilized in pure ice- 
cold methanol and incubated with standard protocols 
first with primary antibody (S9.6 antibody gently fur- 
nished by F. Chedin - University of California, Davis), 
and finally with secondary antibody for signal detection 
(Anti-mouse Ig, fluorescein linked whole antibody from 
sheep, Amersham Biosciences). Images acquisition by 
Nikon Ti-E microscopy equipped with AIR confocal 
system. Fluorescence quantitation analysis has been per- 
formed using ImageJ software with the following formula: 
Corrected Total Cell Fluorescence = Integrated Density — 
(Area of selected cell X Mean Fluorescence of 
Background Readings). 

RESULTS 

Toplccs increase antisense sequence tags at active CGI 
promoters in human cells 

To establish the genome-wide effects of CPT-stabilized 
Toplccs at promoters, we mapped RNA-seq paired tags 
following 4-h CPT (10 uM) treatment of human colon 
cancer HCT1 16 cells. We also performed RNA-seq experi- 
ments in HCT116-shRNATopl cells that have Topi 
contents reduced to ~25% of wild-type cell levels (31,34) 
to directly determine the role of Topi. Paired tags were 
obtained from total cellular RNA depleted of ribosomal 
RNAs and treated with bisulfite to maintain the informa- 
tion of strand direction (Supplementary Data, 'Materials 
and Methods' section). A total of 2.18-2.89 x 10 7 high- 
quality paired tags mapped uniquely to the human 
genome, with 43.4-45.0% of them mapping to the minus 
strand (Supplementary Table SI). Although a large 
portion of reads (43.8-49.3%) mapped to genomic 
regions without gene annotations both in control and 
treated cells (Supplementary Table SI), we have here 
focused on reads and transcripts mapping at gene pro- 
moters. First, we evaluated mRNA expression patterns 
altered by CPT, and determined 1054 and 15 Refseq tran- 
scripts with statistically significant variations (>2-fold) in 
HCT116 and HCT116-shRNATopl cells, respectively. 
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The fraction of downregulated genes was higher than 
upregulated genes (69 against 31% in HCT116 cells). 
The results of Gene Set Enrichment analyses of CPT- 
altered genes with DAVID Bioinformatics Resources 
(43,44) were highly consistent with CPT mode of action 
(Supplementary Figure SI and Supplementary Data Set 1) 
and previous findings (34,45), thus showing that we can 
readily measure significant variations in transcript levels. 

As CPT-stabilized Top Ices affected the overall ratio of 
sense/antisense tags (Supplementary Table SI), we then 
evaluated whether the balance of sense and antisense tran- 
scripts was changed at specific gene regions. By analyzing 
tag distributions along annotated Refseq genes, we found 
evidence that the number of antisense tags was specifically 
and markedly increased by Top lecs in a region immediately 
upstream of TSS and in a manner dependent on gene ex- 
pression levels (Supplementary Figure S2A). Sense tags 
showed some changes at the 5'- and 3'-gene regions while 
decreasing somewhat in the body of genes (Supplementary 
Figure S2A). As an increase of antisense reads upstream of 
TSS could be due to a modification of expression of an 
overlapping gene in a divergent orientation, we then 
determined antisense tag distributions at non-overlapping 
Refseq gene promoters and their dependence on promoter 
features, such as the presence of a CGI or its activity 
measured as FPKM of sense (mRNA) transcripts. The 
average number of antisense tags was increased up to 
4000 bp upstream of TSS, with a peak at 450 bp from 
TSS, in active (0.3-100 FPKM) promoters containing 
CpG islands (Figure 1A). Topi silencing significantly 
reduced CPT effects as antisense tags increased less in 
HCT116-shRNATopl (Figure IB), demonstrating that 
effects on antisense transcripts are strictly related to 
Topi inhibition. Box-plot analyses show that antisense 
tag increase was specific for CGI promoters, as no 
increase was detected at CGI-less promoters (Figure 1C 
and Supplementary Figure S2C). Antisense tags were not 
increased at inactive CGI promoters (0.0-0.3 FPKM; 
Figure ID and Supplementary Figure S2B) and at active 
promoters (>100 FPKM; Figure IE and F, and 
Supplementary Figure S2B), indicating that CPT effects 
were restricted to CGI promoters of intermediate activity. 
As expected, sense tags show a peak downstream of TSS 
with slight changes by CPT in HCT 1 1 6 cells (Figure 1 A and 
Supplementary Figure S2), even though CPT increased 
sense tags in Topl-silenced cells (Figure IB). No differences 
were found between protein-coding and non-protein- 
coding Refseq genes. Taken together, the data demonstrate 
that, even though CPT is a potent transcription inhibitor, 
it can markedly and specifically enhance the number of 
antisense sequence tags at human active CGI promoters. 

Toplcc-enhanced antisense transcripts are found at 
divergent active CGI promoters 

Next, to identify Toplcc-increased transcripts upstream 
to TSSs, we used the Cufflink package with stringent par- 
ameters to reduce false-positive instances, even though 
that may have reduced the number of true transcripts. 
We determined unknown (not annotated) tag clusters 
and their genome location relative to the closest annotated 



Refseq gene and found 8964 clusters long <5000nt in 
HCT116 cells, 809 of which were altered by Top lecs 
(Supplementary Table SII). In all, 2464 tag clusters were 
located within 5000 bases from a Refseq gene, the 
majority (1328) of which were promoter-associated anti- 
sense transcripts consistently with antisense transcription 
being a significant process at human promoters (46,47). Of 
the 809 clusters altered by Top lecs, a significant fraction 
(43.5%) was found within 5000 bp to a Refseq gene 
(Supplementary Table SII), suggesting that transcribed 
regions constitute a main target of CPT effects. 
Moreover, CPT-increased antisense tag clusters at pro- 
moters were the highest fraction (19.5%) of all CPT- 
altered clusters (Figure 2A and Supplementary Table 
SII). The analysis in HCT116-shRNATopl cells showed 
similar general patterns but again CPT effects were less 
marked (Figure 2A and Supplementary Table SII). 

At promoter and terminator gene regions, CPT- 
stabilized Top lecs generally increased tag clusters, with a 
significantly major effect on promoter-associated antisense 
clusters (Figure 2 A and Table 1). In particular, we identified 
256 antisense clusters increased by Toplccs in HCT116 
cells, and almost all of them localized at CGI promoters 
(Table 1 and Supplementary Table SIII and Supplementary 
Data Set 2). Silencing of Topi again decreased the CPT 
effect as we could detect fewer clusters (84) increased by 
Toplccs at CGI promoters of HCT1 16-shRNATopl cells 
(Figure 2A and Table 1 and Supplementary Data Set 3). A 
large majority of them are located up to 1 500 bp upstream of 
TSS (Supplementary Figure S3), and 33 are in common 
between the two cell lines (Figure 2B). Their median size 
was larger in HCT 11 6 than HCT 1 16-shRNATopl cells 
(853 versus 688, Supplementary Table SIV and 
Supplementary Figure S3), consistent with the observed 
antisense tag distributions upstream of TSS (Figure 1A 
and B). To determine whether the antisense transcript 
increase had an effect on sense transcription at the same 
promoter, we analyzed the expression levels of Refseq 
genes having increased antisense transcripts. These genes 
were expressed at moderate to high levels (from 0.2 to 100 
FPKM), and their expression levels were on average un- 
changed after 4h of CPT treatment (Figures IE and 2E). 
Thus, in the studied time frame, the levels of antisense tran- 
scripts do not correlate with levels of sense RNAs. 

Then, we confirmed RNA-seq results by randomly se- 
lecting 10 antisense transcripts increased by CPT at pro- 
moters and measuring their levels by rtPCR (Figure 2C). 
First, we confirmed the presence of these antisense tran- 
scripts and their CPT-dependent increase by primer- 
specific retrotranscription (Supplementary Figure S4). 
Then, we measured their fold-increases after CPT treat- 
ment demonstrating a good agreement with genomic 
sequencing analyses (Figure 2C). As a control, we also 
included one antisense transcript (5' aTNIK) that was 
decreased by CPT based on RNA-seq data (not shown). 
As expected, CPT effects were dose dependent in HCT1 16 
cells (Figure 2C). Antisense transcript increase was much 
lower in Topl-silenced cells showing a dependence 
on cellular Topi levels (Figure 2C). However, Topi 
silencing by itself did not change significantly anti- 
sense transcript levels (Supplementary Figure S5C). 
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Figure 1. Antisense sequence tags are increased by CPT at active CGI promoters in a Topl-dependent manner. (A) Tag distributions along non- 
overlapping Refseq genes of HCT116 cells were analyzed in a region from 4000 bases upstream to 1000 bases downstream the TSS (dotted line and 
arrow). Control and CPT-treated cells are in dotted and straight line, respectively. Sense and antisense tags are above and below the .v-axis, 
respectively. (B) Tag distributions in HCT1 16-shRNATopl cells, in which Topi content is reduced to about the 25% of wt levels. (C) Box-plots 
representing the variation of antisense sequence tags after CPT treatment in the indicated cell lines. Non-overlapping Refseq genes were split in two 
groups: genes with a CGI promoter (CGI, light gray), and genes without a CGI at the promoter (No-CGI, gray). (D) Box-plots representing the 
variation of antisense sequence tags after CPT treatment in HCT116 cells. Non-overlapping Refseq genes with a CGI promoter were split in two 
groups with the indicated level of gene expression (FPKM). (E) Plot of variation of antisense sequence tags after CPT treatment as a function of 
log(FPKM) in HCT116 cells, non-overlapping Refseq genes with CGI promoters. Vertical gray lines represent 0.3 and 100 FPKM value, respectively, 
from left to right. Gray dots are higher than the threshold: module (300). Genes with FPKM = 0 are not reported in the graph. All excluded genes 
had antisense variation below the threshold. (F) The same as in Panel (E), HCT1 16-shRNATopl cells. Two sample comparisons using Wilcoxon 
rank sum tests indicate that all groups are statistically different, one versus the other (HCT116 CGI versus no CGI, P = 2.4e-119; HCT-shRNATopl 
CGI versus no CGI, P= 8.6e-148; HCT116 CGI FPKM < 0.3 versus >0.3, P= 3.9e-115). 
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Figure 2. CPT induces increased levels of unknown tag clusters mainly 
at CGI promoters of genes in a Topi -dependent manner. (A) Pie charts 
representing the effect of CPT on unknown clusters at promoters of the 
indicated cell lines. Tag clusters were long <5000 nucleotides, with no 
annotation at the corresponding genomic region and localized from the 
TSS to 5000 bases upstream. Light gray, black and grey show unknown 
clusters that were increased, decreased and unchanged by CPT, respect- 
ively. (B) Common promoter-associated antisense transcripts (33) 
between HCT116 and HCT1 16-shRNATopl cells. (C) Antisense tran- 
scripts were determined upstream the TSS of selected genes by rtPCR 
in the indicated cell lines. The selected genes showed CPT-increased tag 
clusters as determined with RNA-seq data, with the exception of TNIK 
gene that had an antisense transcript reduced by CPT. (D) Antisense 
transcripts were evaluated by rtPCR in HCT116 after transient 



As HCT1 16-shRNATopl cells are a stable cell line, we 
also tested the effect of a transient silencing of Topi by 
siRNA in HCT116 cells (Supplementary Figure S6). We 
found that Topi -specific siRNAs suppressed CPT effects 
with different rates in HCT116 cells. In agreement with 
data on HCT1 16-shRNATopl (Supplementary Figure 
S5C), transient Topi silencing left substantially un- 
changed antisense transcript levels in untreated cells 
(Supplementary Figure S7). Thus, CPT effects at CGI 
promoters strictly depend on cellular Topi levels. 
Moreover, CPT increased antisense transcripts at the 
studied CGI promoters in other human cell lines, 
including PC3, U20S, COLO205 (Supplementary Figure 
S5A) and N-Tera cells (not shown), showing that the mo- 
lecular response to Toplcc is not restricted to a specific 
cell type. Taken together, these findings confirmed Req- 
seq data and demonstrated that Top Ices specifically 
increase antisense transcript levels at CGI promoters of 
intermediate activity in different human cell types. 

To characterize the promoters with Toplcc-enhanced 
antisense transcripts, we have then investigated their chro- 
matin features as determined by the ENCODE project in 
HCT116 cells (48). An analysis of gene promoter regions 
corresponding to the Toplcc-increased 256 antisense tran- 
scripts in HCT1 16 cells (Table 1 and Supplementary Table 
SIII) with WashU Epigenome Browser tools (http:// 
epigenomegateway.wustl.edu) showed that these regions 
have chromatin features of divergent promoters 
(Figure 3A). RNAPII density, H3K4m3 and H3K27ac 
levels showed two peaks upstream and downstream of 
TSS, whereas gene promoter sets with no change or a 
decrease of antisense tags showed only one peak for 
each of the three chromatin marks downstream to TSS 
(Figure 3B and Supplementary Figure S8). In all cases, 
GC percentage was mainly increased around the TSS 
and downstream of it (Figure 3 and Supplementary 
Figure S8). Although the lower and upper quartiles of 
each single box-plot showed a significant variation of 
chromatin features among the selected promoters 
(Figure 3 and Supplementary Figure S8), nevertheless 
the findings clearly demonstrated a common symmetrical 
chromatin architecture signature with two peaks of paused 
RNAPII and two domains of histone marks at gene 
5'-ends, likely related to a bi-directional (divergent) tran- 
scription activity in HCT116 cells (47,49). 

Toplcc-enhanced promoter-associated antisense 
transcripts are regulated by Cdk9 

As RNAPIIs at divergent promoters have been proposed 
to be regulated after P-TEFb recruitment (50), we have 



Figure 2. Continued 

silencing of Topi protein (black bars). In (C) and (D), PCR determin- 
ations were normalized to cytochrome b mRNA and to untreated cells 
(dotted line). Values are means +/— SEM of at least four determin- 
ations of two independent experiments. Genomic coordinates of 
analyzed antisense transcripts are reported in Supplementary Data 
Set. Primers are reported in Supplementary Table SV. (E) Correlation 
between expression levels in CPT-treated and control cells of the genes 
with CPT-increased promoter-associated antisense transcripts in 
HCT116 (256) and HCT1 16-shRNATopl (84) cell lines. 
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Table 1. Antisense transcripts are altered by CPT at CGI promoters of Refseq genes 



CPT effect Gene promoters Gene terminators 



Antisense transcripts Sense transcripts Antisense transcripts Sense transcripts 



HCT116 HCT116- HCT116 HCT116- HCT116 HCT116- HCT116 HCT116- 

shRNATopl shRNATopl shRNATopl shRNATopl 



Increased 256 a 84 a 30 17 14 21 41 51 

Decreased 3 b l b 5 3 1 0 4 3 



"248 out of the 256 (HCT116) and 80 out of the 84 (HCT1 16-shRNATopl) increased antisense transcripts at promoters are found at CGI promoters. 
b Antisense transcripts decreased by CPT (4 in total) are found at CGI promoters. 
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Figure 3. CPT exerts its effects at bidirectional promoters. (A) Analysis 
with WashU Epigenome Browser tools of the promoter region of the 
CPT-increased 256 antisense transcripts identified by Cufflinks. Shown 
chromatin features are respectively RNA Pol II density, H3K4me3 
density, H3K27ac density and GC percentage in a region from 
1000 bases upstream to 1000 bases downstream the TSS (dotted line). 
(B) Analysis with WashU Epigenome Browser tools of the promoter 
region of 249 randomly selected genes having FPKM between 1 and 
100 and that did not demonstrate in our analysis a modification in 
antisense reads after CPT treatment. 



then investigated the role of the transcriptional kinase 
Cdk9 (a subunit of P-TEFb) by directly downregulating 
it with siRNAs. Cdk9 silencing partially suppressed 
the Toplcc increase of antisense transcripts at almost 
all the tested promoters following 4h of treatment 
(Figure 4A). The observed reduction of Toplcc- 
enhanced antisense transcripts was significant considering 
that kinase silencing was partial (Supplementary Figure 
S9A), and 4h of drug treatment could allow transcript 
accumulation effects. We have then used DRB, a strong 
inhibitor of Cdk9 and RNAPII transcription elongation, 
which reduced the levels of antisense transcripts both 
in control and CPT- treated HCT116 cells (Figure 4B). 
The results therefore showed that Cdk9 activity is 
required for normal extents of antisense transcripts at 
the selected CGI promoters, likely indicating that anti- 
sense transcription is regulated by P-TEFb and Cdk9 at 
proximal-promoter RNAPII pausing step (50). 
Interestingly, even though DRB effectively reduced anti- 
sense transcripts with and without CPT treatment, we 
noticed that CPT could still increase antisense transcripts 
in the presence of DRB for almost all the studied pro- 
moters with similar fold change (Figure 4B and 
Supplementary Figure S10). We have then investigated 
whether CPT affected sense read levels downstream of 
TSS in the studied promoter set. However, we found 
no evidence of any common effect on sense transcripts 
(not shown), even though some increase of sense reads 
can be observed at some of the studied promoters 
(Figure 4C). Thus, the results show that CPT-stabilized 
Top Ices specifically affect antisense versus sense tran- 
scripts at divergent CGI promoters, likely reducing 
RNA degradation. 

Toplcc-induced increase of promoter-associated antisense 
transcripts is independent from DNA replication 

Next, we determined whether ongoing DNA replication 
is required for the observed Toplcc effects at promoters 
by stopping replication with aphidicolin, a specific inhibi- 
tor of DNA polymerases. Aphidicolin could prevent 
CPT-induced H2AX phosphorylation, a marker of 
DNA damage (Supplementary Figure S9B), showing 
that replicative DNA damage promoted by CPT was 
markedly reduced by the compound. Nevertheless, CPT 
still increased antisense transcript levels in a dose- 
dependent manner at the selected CGI promoters even 
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Figure 4. Dependence of CPT effects on antisense transcript levels on ongoing transcription. (A) The effects of 10 uM CPT for 4h on antisense 
transcription have been studied after silencing of CDK9 (white bars) in HCT116 cells. Drug treatment was performed 72 h post-transfection of 
siRNAs. Scrambled siRNA samples are black bars. (B) CPT effects in HCT116 cells were studied at the selected locus with and without DRB. Co- 
treatment: cells were pretreated for 15min with 50 uM DRB before the addition of CPT (lOuM) for 4h. In all cases, PCR determinations were 
normalized to cytochrome b mRNA and to untreated cells (dotted line). Values are means ± SEM of at least four determinations of two independent 
experiments for each panel. (C) Genomic map of the region —4000 to +1000 bp around transcription start site (TSS, bold line) of two analyzed genes 
having increased antisense transcripts induced by CPT. In each case, the top and bottom graphs reports the read distributions in HCT116 and 
HCT1 16-shRNATopl cells, respectively. In shades of blue the patterns of sense reads, in shades of red those of antisense reads (dark colors for 
control cells; light colors for CPT-treated cells). In the centre of each graph, the map from UCSC genomic browser, highlighting the chromosomal 
position of unknown cluster, PCR amplicon, CpG island and the gene 5'-end, respectively. The validation by qRT-PCR has been conducted or in a 
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simply by reads mapping. 
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Figure 5. CPT effects on antisense transcript levels are independent 
from CPT-induced DNA damage at replication forks. Dose-response 
of HCT116 cells was analyzed at the selected locus with and without 
Aphidicolin. Co-treatment: cells were pretreated for 15min with 5 uM 
Aphidicolin before the addition of CPT (1 or 10 uM) for 4h. 
(B) Antisense transcripts were evaluated after treatment of the indicated 
cell lines with TPT 10 uM for 4h. In (A) and (B), PCR determinations 
were normalized to cytochrome b mRNA and to untreated cells (dotted 
line). Values are means +/— SEM of at least four determinations of two 
independent experiments for each panel. 



in the presence of aphidicolin (Figure 5A), showing 
that Top Ices impacts promoter-associated antisense 
transcripts in a replication-independent manner. We 
also tested TPT, a CPT-derivative approved for clinical 
use and showed that it increased promoter-associated 
antisense transcripts in HCT116 cells in a dose-dependent 
manner, whereas the effect was reduced in HCT16- 
shRNATopl cells (Figure 5B). Consistently, an inactive 
CPT analog and etoposide (VP- 16, a TopII inhibitor) 
did not increase antisense RNAs in the studied 
promoters (Supplementary Figure Sll). The data there- 
fore support a specific response to Top lecs at divergent 
CGI promoters. 



CPT induces a rapid burst of Toplccs that correlates with 
a transient RNAPII block at promoters 

Our results clearly showed that CPT-induced Toplccs 
increase antisense transcript levels selectively at active di- 
vergent promoters containing a CGI. Gene Set Enrichment 
analyses (28-30) of this set of genes provided no significant 
finding beyond the generic features of housekeeping genes 
and regulatory activity in nuclear compartments. We 
however noticed that 62% of the 256 unknown tag 
clusters increased by Toplccs (Table 1) overlap with sites 
of DNase I hypersensitivity in HCT1 16 cells (elaborated at 
the UCSC Genome Browser), whereas almost the rest 
(36%) was within 3000 nt from such regions, documenting 
that they map at or close to nucleosome-free regions. 
As Topi has been mapped to open chromatin sites of pro- 
moters (22,51), we then investigated the kinetics of Top ice 
formation at the selected CGI promoters following CPT 
treatment. 

The kinetics of Top Ice formation overall in the genome 
was determined by CsCl gradient purification following an 
immediate lysis of cells with a strong detergent, to prevent 
the reversion of Toplcc. Interestingly, Toplccs are 
increased at high levels after only 2min of CPT treatment, 
and then remain stable with slight variations up to 20 min 
of CPT treatment (Figure 6A). Interestingly, the bound but 
not the free Topi molecules showed high molecular weight 
forms likely due to ubiquitination or sumoylation after 
only 2 min of drug treatment (Figure 6A), showing a 
rapid response of the cells to CPT-stabilized Toplccs. 
Then, we used a K + SDS precipitation assay of SDS cell lys- 
ates (30) to determine Toplcc levels at the selected CGI 
promoters. Along with them, we investigated silent 
promoter regions, centromeric a-satellite DNA [a 
negative region for Topi binding, see (28)], and a known 
Topi binding site at the origin of replication located at the 
3' end of the human lamin B2 gene (52). As expected, the 
findings showed that the lamin B2 region was enriched 
> 20-fold in K + SDS precipitates over a-satellite DNA 
(Figure 6D). We then investigated TSS-proximal regions 
of 7 active promoters (GAPDH, POLR2A, MYC, 
VEGFA, SP2, TDG, ATF1), and the DNA recovery in 
the precipitates showed a bell-shaped curve over time 
of CPT treatment with an initial increase at short times 
(0.5-5 min) followed by a marked decrease at later times 
(10-20 min) (Figure 6D). CPT increased DNA recovery 
from 2- to 4-fold with respect to untreated HCT116 cells. 
The increase was instead much lower in HCT116- 
shRNATopl cells (< 1.5-fold), consistent with the 
reduced cellular Topi levels. CPT did not increase DNA 
recovery at promoters of three silent genes (M YOD 1 , LOR 
and FOXA2 in Figure 6D) showing that Topi is at low 
levels, if any, at inactive promoters. We then looked at 
regions in the body of MYC and HIF-la genes, and the 
findings showed similar kinetics of Toplcc formation by 
CPT at these sites as well (Figure 6E). 

The unexpected DNA recovery kinetics in K + SDS pre- 
cipitates and CsCl gradient results demonstrated that CPT 
could immediately induce the formation of Toplccs 
followed by their rapid removal from active promoters. 
We then determined kinetics of RNAPII density during 
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Figure 6. CPT determines a rapid and transient burst of Toplccs that correlates with transient RNA Pol II block. (A) ICE bioassay was performed 
on HCT116 cells after 2, 5 and 20 min of 10 uM CPT treatment. The gel shows both CsCl gradient fractions containing DNA bound proteins (on the 
left) and free proteins (on the right). P-Actin was used as loading control. (B) ChIP on RNA Pol II. HCT116 cells were treated for 5, 10, 20 and 
30 min with CPT 10 uM and the DNA recovery at indicated gene promoters is plotted as fold increase on not treated cells against treatment time. 
The analyzed gene regions are on TSS or immediately downstream to it and are listed (C). (C) List of genes observed in ChIP and K + SDS Assays 
[(B) and (D) panels], with the relative FPKM obtained by RNA-Seq. Used primers are reported in Supplementary Table SVI. (D) K + SDS precipi- 
tation assay on HCT116 and HCT1 16-shRNATopl cells. Cells were treated for 2, 5, 10 and 20min with 10uM CPT. The recovery of Toplccs at 
indicated gene promoters is reported as fold increase or percentage on input on not treated cells against treatment time. Used primers are in 
Supplementary Table SVI. (E) K + SDS precipitation assay on HCT116 in different positions along MYC and HIF-la genes. Below the DNA 
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as follows: 37% of the mean value (panel B); 19 and 37% of mean values for top and lower graphs (panel D HCT116), respectively; 16 and 23% of 
mean values for top and lower graphs (panel D HCT1 16-shRNATopl), respectively. 



CPT treatment at the same promoters with ChIP 
(Figure 6B). Interestingly, at active promoters with a high 
rate of RNAPII recruitment, the results show an accumu- 
lation of the polymerase followed by a reduction of it during 
drug treatment (Figure 6B). The timing of the bell-shaped 
curve correspond to the timing of Topi cc burst, suggesting 
that CPT-trapped Toplccs at promoter sites can block 
polymerase escape from promoter regions, and their 
removal can eventually release RNAPII allowing pausing 
escape (31). 

CPT transiently increases R-loops in whole cells 
paralleling the bursts of Toplccs 

As a rapid increase of CPT-induced Toplccs may affect 
DNA supercoiling at transcriptionally active genes (51), 



we also investigated the formation of R-loop structures 
(favored by negatively supercoiled DNA) with an 
antibody against the DNA/RNA hybrid duplex (53) by 
immunofluorescence and cell imaging in HCT116 and 
HCT1 16-shRNATopl cells. The kinetics of R-loop forma- 
tion appear to parallel the burst of Toplccs reported earlier 
in the text (Figure 6), as R-loops were significantly increased 
by 2-10 min of CPT treatment while they were reduced at 
longer (1 h) times (Figure 7A and B) in HCT1 16 cells. The 
fluorescence signal was mainly increased in nucleoli and at 
cytoplasmic spots localized at the periphery of the nucleus 
(Figure 7A). These cytoplasmic spots likely correspond to 
mitochondria where R-loops are present at the DNA origin 
of replication, as evidenced by mitochondria-specific 
staining in the same cells (Supplementary Figure S12). 
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After 1 h of CPT, fluorescence distribution was further 
changed with a single (or few) prominent spot(s) detected 
in the nucleus (Figure 7), suggesting that R loop structures 
last longer at some genomic regions. Interestingly, 
the findings in HCT1 16-shRNATopl cells were different. 
First, in control cells, fluorescence was higher in HCT1 16- 
shRNATopl than HCT116 cells, and in particular, nucle- 
olar signal was marked (Figure 7A and B). Second, CPT 
did not increase the R-loop signal in HCT1 16- 
shRNATopl cells (Figure 7A and B). However, at longer 
time fluorescence intensity decreased, and patterns were 
more similar in both cell lines. Thus, cell images show that 
CPT can induce a rapid and transient increase of R-loops 
in a Top 1 -dependent manner, likely due to negatively super- 
helical tension of DNA template at actively transcribed 
regions, such as ribosomal genes. 

DISCUSSION 

The present findings demonstrate for the first time that 
Top Ices have a specific and dynamic impact at divergent 



CGI promoters in human cells. CPT-stabilized Toplccs 
enhance levels of antisense RNAPII transcripts upstream 
of TSS at divergent CGI promoters of intermediate 
activity. The CPT effects are dependent on cellular Topi 
levels, and independent from DNA replication. Toplccs 
immediately form at transcribing regions, but then their 
levels decrease after only 10 min of drug treatment. The 
bell-shaped Toplcc kinetic parallels a similar transient 
increase of RNAPII density at promoters, indicating that 
Toplccs transiently block the polymerase. Toplccs are 
stabilized not only by CPT but also by oxidative base 
damage, base mismatches and ribonucleotides incor- 
porated into DNA strands (14,54). Thus, Toplccs may 
physiologically occur in living cells at a significant rate 
and may cause specific dynamical changes leading to alter- 
ations of RNAPII recruitment and antisense transcripts. 

Divergent promoters have been reported in several eu- 
karyotic cells (46,47,50,55-57) raising the question of how 
this process contributes to regulation of gene activity 
and chromatin structure. It is therefore interesting that 
the majority of TSSs in murine embryonic stem cells 
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show evidence of bidirectional transcription (47). Previous 
results showed that divergent RNAPIIs are regulated by 
P-TEFb at post-initiation pausing sites and that antisense 
transcript levels are controlled by the exosome (50). Our 
results are in agreement with a role for Cdk9 (a P-TEFb 
subunit) in antisense transcription, as Cdk9 inhibition 
and silencing can reduce the cellular levels of antisense 
transcripts. We have previously shown that CPT causes a 
specific transcriptional stress with RNAPII hyperphos- 
phorylation, increased RNAPII escape from promoter 
pausing sites, chromatin accessibility marks and antisense 
transcripts at human HIF-la and c-Myc genes (31). 
Moreover, inhibition of Cdk9 and Cdk7 activity by DRB 
could suppress the CPT-induced RNAPII escape and 
hyperphosphorylation and increased chromatin accessibil- 
ity. However, we show here that DRB is not able to sup- 
press the CPT-induced increase of antisense transcript 
levels at the studied divergent CGI promoters. As DRB is 
an effective transcription inhibitor and it does not suppress 
Top Ices induced by CPT (14), we therefore propose that 
the CPT effect may mainly be due to a reduction of RNA 
degradation. This hypothesis is also supported by the 
finding that reducing the RNA degradation pathway by 
silencing exosome subunits can enhance promoter- 
associated antisense transcripts in human cells (58). 

A recent article (51) showed that human promoters of 
active genes constitute topological domains of negative 
supercoiling mainly due to the short range of torsional 
stress dissipation. As Top2 and Topi can be functionally 
redundant, the authors found that Topi is mainly respon- 
sible for supercoil relaxation at promoters of intermediate 
activity, whereas Top2 becomes more important at highly 
active promoters (51). Consistently, we have here found 
that Top lecs increase antisense transcripts at intermedi- 
ately active divergent CGI promoters, whereas we did not 
detect a similar effect by inhibiting TopII with etoposide. 
Thus, the specific increase of antisense transcripts by CPT 
likely occurs where Topi constitutes the main DNA relax- 
ation activity. We cannot however exclude that TopII 
inhibitors may alter antisense transcript levels in a differ- 
ent set of gene promoters. 

In addition, CPT has been shown to increase negative 
supercoils at promoters of intermediate activity after 5 min 
of cell treatment (51). In bi-directional promoters, the 
rate of transcription-generated negative torsional tension 
is necessarily higher as divergent RNAPIIs elongate in 
different directions. Thus, divergent CGI promoters may 
constitute a set of gene control regions highly sensitive to 
exogenous and endogenous perturbation of Topi activity. 
CGI promoters may readily respond to Top lecs by an 
increase of negative supercoils leading to strand denatur- 
ation and formation of non-B structures. Interestingly, 
TOPI gene deletion has been shown to cause an increase 
of R-loops at actively transcribed rRNA genes in yeast, 
leading to an accumulation of slowed RNAPs I at the 5' 
end of the gene (59). 

An important finding of the present work is the demon- 
stration that CPT induces a stabilization of R loops at 
nucleoli and mitochondria in HCT116 cells with a 
kinetic that closely parallels Toplcc formation 
(Figures 6 and 7). Interestingly, cells with reduced Topi 



levels (HCT116-shRNATopl) show higher levels of R 
loops at nucleoli and whole cells without CPT treatment. 
This finding is consistent with a previous investigation 
showing that overexpression of RNaseH can reduce 
genomic instability and replication/transcription conflicts 
due to Topi silencing (35). R-loops can form physiologic- 
ally at sequences showing a G-richness in one strand, 
known as GC skew (53). We noticed that 185 of 246 
promoter-associated antisense transcripts overlap with or 
are within 3000 bp distant from a GC skewed region. In 
addition, some of our antisense transcripts overlap to 
DNA segments known to form R-loops in untreated 
human cells (53). Thus, we propose that Toplccs may 
cause an increase of negative supercoils at divergent 
CGI promoters determining a stabilization of R-loops, 
and then trigger an accumulation of promoter-associated 
antisense transcripts. Even though further studies are 
needed to establish such hypothesis, Toplccs-dependent 
genome instability may preferentially occur at divergent 
CGI promoters of intermediate activity. 

Toplcc-triggered antisense transcripts may constitute a 
specific molecular response to Toplccs in normal and 
stress conditions. It remains to be established if they 
play a role in the maintenance of vital cellular processes 
under stress conditions such as increased levels of 
Toplccs. Antisense transcripts have been shown to con- 
tribute to the regulation of gene expression regulation. We 
recently showed that the 5' aHIFla antisense RNA of the 
human HIF-la gene may have a role in the nucleus- 
cytoplasm transport machinery (33). In addition, 
accumulated antisense transcripts may interact with 
RNAPII or transcription factors to regulate chromatin 
structure, recruitment and sense transcription. This has 
been shown for transcripts upstream of the cyclin Dl 
gene that recruit TLS, a RNA-binding factor sensitive 
to DNA damage (60), and of Polycomb target genes 
that bind to Suzl2 to repress gene transcription (61). 
Thus, antisense transcripts may regulate gene expression 
through different mechanisms at divergent CGI promoters 
in response to Toplccs. However, the definition of 
their functions remains to be established by further 
investigations. 
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